Picture for Lei Hou

Lei Hou

LecEval: An Automated Metric for Multimodal Knowledge Acquisition in Multimedia Learning

Add code
May 04, 2025
Viaarxiv icon

Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks

Add code
Apr 26, 2025
Viaarxiv icon

ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation

Add code
Mar 27, 2025
Viaarxiv icon

Invertible Koopman neural operator for data-driven modeling of partial differential equations

Add code
Mar 25, 2025
Viaarxiv icon

MRCEval: A Comprehensive, Challenging and Accessible Machine Reading Comprehension Benchmark

Add code
Mar 10, 2025
Viaarxiv icon

Sparse Auto-Encoder Interprets Linguistic Features in Large Language Models

Add code
Feb 27, 2025
Viaarxiv icon

Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Add code
Feb 26, 2025
Viaarxiv icon

LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models

Add code
Feb 20, 2025
Viaarxiv icon

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

Add code
Dec 19, 2024
Figure 1 for LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks
Figure 2 for LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks
Figure 3 for LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks
Figure 4 for LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks
Viaarxiv icon

EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents

Add code
Dec 16, 2024
Figure 1 for EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents
Figure 2 for EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents
Figure 3 for EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents
Figure 4 for EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents
Viaarxiv icon